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By Thomas P. Hayes^ and Alistair Sinclair^ 

Toyota Technological Institute and University of California at Berkeley 

We prove that any Markov chain that performs local, reversible 
updates on randomly chosen vertices of a bounded-degree graph nec- 
essarily has mixing time at least r2(nlogn), where n is the number 
of vertices. Our bound applies to the so-called "Glauber dynamics" 
that has been used extensively in algorithms for the Ising model, 
independent sets, graph colorings and other structures in computer 
science and statistical physics, and demonstrates that many of these 
algorithms are optimal up to constant factors within their class. Pre- 
viously, no superlinear lower bound was known for this class of algo- 
rithms. Though widely conjectured, such a bound had been proved 
previously only in very restricted circumstances, such as for the empty 
graph and the path. We also show that the assumption of bounded 
degree is necessary by giving a family of dynamics on graphs of un- 
bounded degree with mixing time 0{n). 

1. Introduction. A large fraction of Markov chain Monte Carlo algo- 
rithms, as studied in both theoretical and practical settings, fall into the class 
of "reversible single-site dynamics," or "Glauber dynamics," on bounded- 
degree graphs. In this class of algorithms, one is given a finite undirected 
graph G = {V, E) of maximum degree A and a finite set of values Q, together 
with a Markov chain whose state space is C Q^, a subset of all possible 
assignments of values to the vertices of G. (Following the terminology of 
spin systems in statistical physics, which is one of the major application 
areas, we shall refer to vertices as "sites," values as "spins" and assignments 
as "configurations." Note that we do not assume that all configurations are 
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allowed, i.e., the spin system may have so-called "hard constraints.") The 
Markov chain is designed to sample from a given probability distribution vr 
on Q. At each step, it selects a site v €V uniformly at random and modifies 
the spin at v according to a randomized update rule; this rule is required 
to be local, in the sense that it depends only on the current spins at v and 
its neighbors in G, and reversible with respect to vr. (See Example 5 in 
Section 2 for precise definitions.) Under mild additional assumptions, these 
conditions ensure that the Markov chain is ergodic and converges to the sta- 
tionary distribution tt. Thus, an algorithm which starts from an arbitrary 
initial configuration and performs sufficiently many random spin updates 
samples configurations (approximately) from vr. 

While our framework is rather more general, the most common scenario 
is when vr is a Markov random field; that is, for any subset of sites U '^V, 
when the spins outside U are fixed, the conditional distribution over config- 
urations inside U depends only on the spins on the boundary, dU . Markov 
random fields are ubiquitous in statistical physics (where they describe sys- 
tems of particles with nearest-neighbor interactions) and in statistical mod- 
eling (where they capture conditional independence relations among random 
variables). For Markov random fields there is a particularly natural update 
rule called the "heat-bath" rule (or "Gibbs sampler" ) , in which the spin at v 
is replaced by a random spin chosen from the correct distribution conditional 
on the spins of its neighbors. Many other variants (such as the "Metropolis 
rule" — see Section 2) are possible. 

Clearly, the efficiency of the Markov chain Monte Carlo algorithm depends 
on the rate of convergence to equilibrium, or "mixing time" of the Glauber 
dynamics. This is the number of steps required until the distribution of the 
Markov chain is close (in total variation distance) to the stationary distri- 
bution vr. In recent years, motivated by applications in statistical physics, 
combinatorics and artificial intelligence, much progress has been made in 
deriving upper bounds on the mixing time. In many cases, it has been pos- 
sible to show that the mixing time is as fast as O(nlogn), where n is the 
number of sites in G. Celebrated examples include the two-dimensional Ising 
model above the critical temperature [14], graph colorings with sufficiently 
many colors [13] and the hard-core model (independent sets) at sufficiently 
low densities [17]. An upper bound of 0(n log n) arises naturally from vari- 
ous techniques for bounding the mixing time, such as coupling [2] and the 
log-Sobolev constant [7] and, at least among physicists, is generally taken 
as the criterion for "rapid mixing" (rather than the weaker notion of being 
bounded by a polynomial in re, which is more common in computer science). 

The generally accepted "folklore" about these results is that they are 
optimal, in the sense that the mixing time of Glauber dynamics can never 
be o(n log re). The justification for this folklore is that, by a standard coupon 
collecting argument, after substantially less than nlnn steps a significant 
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number of sites have never been updated from their initial values, so mixing 
cannot have occurred. However, converting this intuition into a rigorous 
proof has turned out to be elusive except in very restricted cases, namely 
when G is the empty graph (i.e., all spins are completely independent) [2, 
6, 8], and when G is a path and tt the uniform distribution over proper q- 
colorings {q > 3) [9] . This is actually a manifestation of a relative lack of tools 
for proving lower bounds on the mixing time of Markov chains, in contrast 
to the wide range of techniques for upper bounds that is now available. (The 
standard tool is to identify a "bottleneck" in the chain, and has been used 
to obtain exponential lower bounds on mixing times in specific cases; see, 
e.g., [5, 10, 12].) 

To get a feel for why this issue is more subtle than it may seem at first 
glance, consider the very simple case in which G is the empty graph on 
n vertices, the spins are Q = {0,1}, and the update rule replaces the spin 
at the chosen site by the outcome of a fair coin toss. Thus in the stationary 
distribution vr all spins are i.i.d. uniform. By coupon collecting, the number 
of steps needed to update all the sites is, with high probability, nlnn (plus 
lower-order terms). However, it is well known [2] that only about ^nlnn 
steps suffice to mix in this case, so the coupon collecting analogy fails. The 
intuition behind this bound is that after In n + uj{n) steps only o(n^/^) 
of the spins will not have been updated, which is of lower order than the 
standard deviation of the number of 1-spins in the stationary distribution. 
Hence the effect of the untouched spins is statistically undetectable in total 
variation. A further twist on this example yields a more surprising outcome. 
Suppose we modify the update rule so that the chosen spin is ffipped with 
probability 1 — and left unchanged with probability Remarkably, 
it turns out [2, 6] that the mixing time for this modified dynamics is only 
|nlnn + 0{n), after which time 0(n^/^) of the sites have not yet been 
updated! The reason for this discrepancy, of course, is that the update dis- 
tribution is not uniform: by judiciously choosing the probability of ffipping 
the spin, we have accelerated mixing. In this case the speedup is only by a 
constant factor. But, as this simple example indicates, it is quite plausible 
that in a more complex setting with nontrivial spin interactions it may be 
possible to tune the Glauber dynamics so as to achieve o(nlogn) mixing 
time. 

In this paper we prove, in a very general setting, that this is not possible. 
Specifically we show that any nontrivial Glauber dynamics on any graph G 
of bounded degree must have mixing time r2(n log n). (The constant con- 
cealed in the here depends only on the maximum degree A of G.) 
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Theorem 1.1. Let A > 2 be a fixed positive integer, and let G be any 
graph on n vertices of maximum degree at most A. Any nonredundant'^ 
Glauber dynamics on G has mixing time ri(nlogn), where the constant in 
the il(-) depends only on A. 

We note that Theorem 1.1 apphes, in particular, to the standard heat- 
bath dynamics for ah widely studied nearest-neighbor systems in statistical 
physics, including the Ising model, the hard-core model, and graph colorings, 
with arbitrary boundary conditions. 

Our lower bound actually holds in somewhat greater generality. In par- 
ticular, it applies to so-called "block dynamics," a variant of Glauber dy- 
namics in which at each step a randomly chosen constant-size block of sites 
(rather than just a single site) is updated, again via a local, reversible rule; 
moreover, the assumption that the site to be updated is chosen uniformly 
can be relaxed to the requirement that no site is chosen with probability 
greater than 0{\/n) (no lower bound on the site selection probabilities is 
needed). On the other hand, the assumption of bounded degree is essen- 
tial: we complement Theorem 1.1 by exhibiting a Glauber dynamics on a 
family of n-vertex graphs of unbounded degree for which the mixing time 
is only 0(n). In fact, we prove the following trade-off between maximum 
degree and the mixing time: 

Theorem 1.2. For each n, let A(n) be any natural number satisfying 
2 < A(n) < n. Then there exists a family of graphs G = G{n), where G{n) 
has n vertices and maximum degree A(n), and an associated Glauber dy- 
namics on G{n) with mixing time 0(n log n/ log A (n)). 

The two main ingredients in the proof of Theorem 1.1 are "disagree- 
ment percolation," which bounds the rate at which information can flow 
between sites in the Glauber dynamics as a function of their distance, and 
"complete monotonicity," which implies that, for a given site, under suitable 
initial conditions, the probability of the site having its initial spin decreases 
monotonically with time. We use these tools, together with a two-stage cou- 
pling argument, to identify initial conditions for the dynamics which have 
a statistically observable effect that persists for il(nlogn) steps. A similar 
overall strategy was used by Dyer, Goldberg and Jerrum in their study of 
the Glauber dynamics for colorings of a path [9]. However, in their case the 
simplicity of the graph and the restriction to colorings made it possible to 



^The nonredundancy condition merely requires that for every site there are at least two 
distinct spin values that appear in some feasible configuration; see Section 2.1. Without 
this condition, we could simply pad the graph with an arbitrary number of "frozen" sites 
without increasing the mixing time per site. 
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do the calculations explicitly, whereas in our general setting we have to re- 
place those calculations with general arguments based on percolation and 
monotonicity in arbitrary graphs with arbitrary spin systems. 

The remainder of the paper is organized as follows. In the next section we 
introduce some basic terminology and background, including a translation 
from discrete- to continuous-time Glauber dynamics which allows us to use 
the latter for convenience in the rest of the paper. In Section 3 we develop our 
principal tools: disagreement percolation and complete monotonicity; and in 
Section 4 we use them to prove our main theorem, a lower bound of i}{n log n) 
for the Glauber dynamics on any bounded-degree graph. Finally, in Section 5 
we discuss various extensions of our main result, and give examples which 
show that the assumptions of bounded degree and of (moderately) uniform 
random site selection are necessary. 

2. Preliminaries. 

2.1. Glauber dynamics. Let G = {V,E) be an n-vertex undirected graph 
of maximum degree at most A > 2, and let Q = {1, . . . ,q} be a finite set of 
spins. We refer to the vertices of G as sites, and we identify (induced) sub- 
graphs of G with their sets of sites. A configuration on G is an assignment 
a:V ^ Q of spins to sites. We specify an arbitrary set O C of configu- 
rations, which we call feasible. If not all configurations are feasible, we say 
that the associated spin system has "hard constraints." 

We will use the term "Glauber dynamics" to refer to any ergodic Markov 
chain on that, at each step, picks a site v £V uniformly at random and 
applies a local, reversible update to the spin at v. More precisely, in (feasible) 
configuration a Gil it does the following: 

1. pick V GV uniformly at random (u.a.r.); 

2. replace the spin a{v) = s by a new spin s' chosen with probability K(7,i)(s, s'), 
resulting in a new feasible configuration a' . 

The update distributions Ka,v{s, •) are required to be local, that is, to depend 
only on the values of o" at v and its neighbors, and reversible with respect to 
some probability distribution vr that assigns nonzero weight to every a gQ, 
that is, to satisfy the detailed balance conditions 



The most common scenario in applications is when the distribution vr is 
a "Markov random field" (MRF) on G. This means that, for all subsets 
U C.V, when the configuration outside U is fixed, then the conditional dis- 
tribution on the configuration inside U depends only on the spins on the 
boundary of U, that is, on dU = {v E V \ U -.Bu £ U with {u, v} £ E}. Thus 
in particular the conditional distribution TTa,v{') of the spin at v depends 
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only on the spins cr{u) at the neighbors u of v. This admits the fohowing 
two very natural local update rules that are reversible with respect to vr: 

1. Heat-bath dynamics (or Gibbs sampler). Set Ka,vis,s') = TT^^vis'), that 
is, the new spin at v is chosen from the conditional distribution given the 
spins at the neighbors of v. 

2. Metropolis dynamics. Set Ka-^v{s, s') = minj^^vE^^ ij. for s / s', and 
Ha,v{s,s) = 1 — J2s'^s'^o;v{Sts'). That is, choose s' £ Q u.a.r., and "accept" 
this choice with probability that depends on the ratio TTcr,v{s')/iT^^v{s), oth- 
erwise leave o'{v) unchanged. 

Note, however, that our class of dynamics includes examples that are not 
MRF's (see Example 5 in Section 2.2). 

We always assume that the Glauber dynamics is irreducible, that is, for 
any two feasible configurations o", r, we have P^{a,T) > for some finite t, 
and aperiodic, that is, for all feasible cr, r, we have gcd{t : P*((j, r) > 0} = 
1. (Note that aperiodicity is essentially a trivial requirement, and can be 
enforced by introducing a uniform self- loop probability everywhere.) These 
two conditions, together with reversibility, ensure that the Glauber dynamics 
(Xf)^Q converges to the stationary distribution tt as t ^ oo, for any feasible 
initial configuration Xq. We measure the rate of convergence by the mixing 
time. 



where || • || denotes total variation distance^ and (with some abuse of nota- 
tion) we identify the random variable Xt with its distribution. The constant 



for all natural numbers k. 

Finally, we will always assume that the Glauber dynamics satisfies the 
following nonredundancy condition: for every site there are at least two 
distinct spin values that appear in some feasible configuration. Without 
such a condition, the system could be artificially padded with an arbitrary 
number n' of additional "frozen" sites while increasing the mixing time by 
a factor of only {n + n')/n. Equivalently, the mixing time of the continuous- 
time dynamics (see Section 2.3) would be unchanged. 



^The total variation distance between two probability distributions fi,rj on a, finite set 
is defined by ||^ - t]\\ ~ \ J^^en l'^(^) ^ ''(^)l- 
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2.2. Examples. We now present a few well-known examples that fit into 
the above framework. The first four are Markov random fields, so the heat- 
bath or Metropolis dynamics applies to them (and both are completely spec- 
ified by the distribution vr). The final example shows that the framework also 
includes systems that are not MRFs. 

1. The Ising model. Here Q = {±1}, and the distribution over configura- 
tions is given by 



where P is the inverse temperature and h is the external field. There are no 
hard constraints. This distribution assigns large weight to configurations in 
which many neighboring spins are aligned, and many spins have the same 
sign as h. The amount by which such configurations are favored is deter- 
mined by (3. Plainly this is an MRF as the conditional probability of a 
configuration on any subset U depends only on the spins on the neighbors 
of U^. We note also that this (and the other models we consider) can be 
extended to the case of arbitrary boundary conditions, that is, fixed spins on 
certain sites of G. More precisely, to do this we remove the sites with fixed 
spins (which are redundant) and incorporate the effects of these spins into 
the potentials on the neighboring sites. 

2. The hard-core model {independent sets). Here Q = {0, 1}, and we call 
sites with spins and 1 "unoccupied" and "occupied" respectively. The 
feasible configurations a are those in which no two adjacent sites are occu- 
pied, that is, they are just the independent sets in G; thus there are hard 
constraints. A feasible configuration with k occupied sites is assigned prob- 
ability proportional to A*^, where the parameter A > controls the density 
of occupation. 

3. Graph colorings {the zero-temperature antiferromagnetic Potts model). 
Here Q = {1, . . . ,q} is thought of as a set of colors, and configurations as 
vertex colorings of G. The feasible configurations are proper colorings, that 
is, those in which no two adjacent vertices receive the same color. The dis- 
tribution vr is uniform over proper colorings. Note that the heat-bath (or 
Metropolis) dynamics may not be irreducible in this case; for instance, there 



Note that this MRF is completely specified by "potentials" on the sites and edges of G, 
that is, it is a "Gibbs distribution" familiar from statistical physics. [Here the site poten- 
tials are (^„(o-(w)) = ~hl3a{i}) for all v £V, and the edge potentials are <l)uv{o'{u),a{v)) — 
—(3a{u)a{y) for all {u,v} € E] The Hammersley-ClifTord theorem (see, e.g., [4]) ensures 
that such a representation — generalized to include a potential function (pc for each clique C 
of G — is in fact always possible for an MRF with no hard constraints. If the potentials 
are allowed to approach infinity, the same holds for MRF's with hard constraints under 
additional assumptions; see [15]. 



vr 
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may exist proper colorings that are "frozen," in the sense that no single site 
can have its color changed (even though other proper colorings do exist). 
However, it is easy to check that if there are sufficiently many colors, specif- 
ically if g > A + 2 where A is the maximum degree of G, then the dynamics 
is ergodic over all proper colorings. 

4. Constraint satisfaction problems. Let {xi, . . . ,Xm} be a set of Boolean 
variables, and let Ci, . . . , be a set of constraints; that is, each Cj speci- 
fies, for some subset of the variables, a set of allowed combinations of truth 
values for those variables. (A canonical example is the Satisfiability prob- 
lem fc-SAT, in which each constraint is of the form V V • • • V Zi^, , where 
each Zi. denotes either a variable or its negation.) An associated MRF is 
defined as follows. The graph G has one site for each variable, and an edge 
between a pair of variables iff they appear together in some constraint. The 
spin values are Q = {True, False}, and feasible configurations are truth as- 
signments that satisfy all the constraints. The distribution vr is uniform over 
feasible configurations. In this example the Glauber dynamics may again fail 
to be irreducible; this is frequently overcome by considering a "soft" version 
in which all configurations are feasible but are penalized according to the 
number of violated constraints they contain. 

5. A non-Markovian example. It is not hard to construct instances of 
Glauber dynamics in which the stationary distribution vr is not an MRF. 
For a simple example, let G be the cycle on n vertices, and Q = {0,1}. 
Feasible configurations are those in which the set of spin-0 sites form a 
contiguous block of size 1 < i < n — 1. The update rule for the spin at any 
site V is the following: if the spins at the two neighbors of v differ, then flip 
the spin at v, else leave it unchanged. It is easy to check that this dynamics 
is ergodic and reversible with respect to the uniform distribution on feasible 
configurations. However, this distribution is clearly not a MRF. 

2.3. Continuous-time dynamics. It will be convenient in our proofs to 
work with a continuous-time version of the Glauber dynamics, which we 
now describe. Given the discrete-time dynamics (X^)^q, the continuous- 
time version (Xf )t>o is defined as follows: 

• there is an independent, rate-1 Poisson clock associated with each site; 

• when the clock at v rings, the spin at v is updated as in the discrete-time 
dynamics. 

This process can be viewed equivalently as follows, where n = \ V\: 

• there is a single, rate-n Poisson clock associated with G; 

• when the clock rings, a site v is chosen u.a.r. and an update is performed 
at V. 
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It is easy to verify that in distribution is equal to X'^^^^y where N{nt) 
is the number of rings of a rate-n Poisson clock in time interval t. In other 
words, for every t>0, 

(3) Xfife-'^^X^ 



oo 

I 

s=0 



The mixing time for the continuous dynamics is defined by analogy with (1) 



as 



(4) T^i,:=maxinf|t:||Xf-7r||<^|. 

The following observation guarantees that lower bounds on the mixing time 
translate from the continuous-time to the discrete-time setting. Hence we 
may work in continuous time without loss of generality. 

Proposition 2.1. For every t>0, \\Xf - 7r|| > ||^2t/n ~ ^11 ~ 

Proof. Note first that by (3) and the triangle inequality we may write 

oo 

U2t/n - ^11 < E Pr(A^(2t) = s)\\Xf - nl 

s=0 

Splitting the range of summation at t, and using the fact that \\Xf — 7r|| is 
a decreasing function of s, we find 

- vrll < Pr(iV(2t) <t) + {{Xj" - 7t\\. 

The desired result follows because, for every t > 0, Pr(A^(2t) <t)< 2e~*, an 
elementary property of the Poisson distribution. □ 

Corollary 2.2. The mixing times for the discrete- and continuous- 
time dynamics satisfy 

' mix ^ „ 
O 

Proof. Set Xq to achieve the maximum in (4). Recall from (2) that 

II 3Tniix II g3 ' 

Hence, by Proposition 2.1 applied with t = 3rmix, and since Tmix > 1, 

n r ,,1 ,^3 1 

Xg . /„ - vr < ^ + 2e-3"-- < ^ < TT 



which implies t^^^ < Grmix/"-- D 
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Corollary 2.2 says that any lower bound on the mixing time of the continuous- 
time dynamics translates immediately to discrete time, with the loss of only 
a small constant factor. The translation involves a uniform scaling factor 
of n, arising from the differing clock speeds: while the discrete-time dynam- 
ics hits any given site only about once in n steps, the continuous-time version 
updates the sites at a constant rate. 

3. Basic ingredients. In this section we introduce two basic tools that 
will play important roles in our proof. The first bounds the rate at which in- 
formation can percolate in the Glauber dynamics, and the second expresses 
a useful monotonicity property of the probability of being in a certain set of 
configurations. Neither of these is new per se, but our application requires 
some refinements of them (notably. Lemmas 3.4 and 3.5). We work through- 
out in continuous time; Section 2.3 shows how to adapt our results to the 
discrete-time setting. 

3.1. Disagreement percolation. Suppose {Xt) and (Yt) are two copies of 
the Glauber dynamics which agree at time except on some subset A of 
the sites. Let A' be another subset of sites at distance d from A (so that, 
in particular, Xq and Yq agree on A'). If t is not too large, then we would 
expect the distributions of the spin configurations on A' in Xt and Yt not 
to differ too much. The extent to which they differ as a function of t is a 
measure of the rate of information flow in the dynamics, since any difference 
must be caused by a disagreement percolating from A to A' . 

We can bound this effect by coupling the evolutions of Xt and Yt. Specifi- 
cally, we will use a greedy coupling: we make the two processes use the same 
Poisson clocks and, whenever the clock at site v rings, we update it to the 
same value in both processes with the largest possible probability. Thus in 
particular, if at the time the clock rings Xt and Yt agree on v and all its 
neighbors, then this will also hold with probability 1 after the update. 

The following lemma bounds the probability that a disagreement per- 
colates from A to A' under this coupling. For each S G V, recall that 
dS := {w & V \ S ■.3u G S with {u, w} G E} denotes the boundary of S, and 
define the "internal boundary" of S by 5S := d{V \ S). (Note that 6S C S.) 

Lemma 3.1. Let (Xt) and (Yt) be continuous-time Glauber dynamics on 
a graph G = {V,E) of maximum degree at most A. Suppose Xq = Yq at all 
sites in V \A. Let A' CV be a set of sites at distance d> from A. Then 
the greedy coupling of (Xt) and (Yt) satisfies 

(5) Pr(Xi = Yt on A')>1- mm{\5A\,\6A'\} 

Moreover, the same conclusion holds even if the spin update probabilities of 
(Xt) and (Yt) differ at sites in A. 
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We will often apply Lemma 3.1 in the situation where A' = Br{v) (the 
ball of radius r in G centered at v), and A = V\ Bji^i{v) for > r [so that 
the evolutions of {Xt) and (Yt) in the ball are the same except for 

possible effects originating from the boundary Sji{v) := dBji-i{v)]. In this 
case, the bound in (5) implies 



The proof of Lemma 3.1 relies on the following simple fact. 

Observation 3.2. Let t >0 and r > 1. Consider r fully independent 
Poisson clocks of rate 1. Then the probability, p, that there is an increasing 
sequence of times < ti < ■ ■ ■ < tr < t such that clock i rings at time ti 
satisfies 



Proof. Since the waiting times between each successive pair of rings of 
a single clock are independent exponential random variables with mean 1, 
the event in the lemma has the same probability as the event that a single 
rate-1 Poisson clock has at least r rings by time t. The probability of this 
event is thus 



The second inequality here follows because r! > {"^Y for r > 1. □ 

Proof of Lemma 3.1. By the initial conditions and the properties 
of the greedy coupling, Xt and Yj can differ at a site in A' only if there 
exists a "path of disagreement" from A to A' . This is a path vi,. . . ,Vr in G 
from a site vi S 5 A to a site Vr G 5^4', together with a sequence of times 
t\ < ■ ■ ■ <tr <t such that an update is performed at Vi at time ti. Since all 
such paths have length at least d, we may restrict attention to their initial 
segments of length d. For any such initial segment, by Observation 3.2 the 
probability that the corresponding updates are performed is bounded above 
by (^)'^. The number of such segments is clearly at most I^^IA"^. We may 
also replace 5 A by 5 A' by considering the paths in reverse order. A union 
bound now completes the proof. Finally, note that the argument did not 
rely on any properties of the update probabilities for sites in A. □ 

We now derive two further lemmas which use similar ideas but which 
apply only to dynamics with hard constraints. Call a site frozen in a given 
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configuration if its current spin is the only feasible value given the spins on 
its neighbors. These lemmas bound the rate at which a set of frozen spins 
can become unfrozen, or vice versa. 

Lemma 3.3. Let (Xt) be continuous-time Glauber dynamics on a graph 
G= {V,E) of maximum degree at most A. Suppose Xq is frozen except at 
sites in AcV. If A' CV is a set of sites at distance d> from A, then 



Proof. Same as the proof of Lemma 3.1. □ 

Lemma 3.4. Let (Xf) be continuous-time Glauber dynamics on a graph 
G = {V,E) of maximum degree at most A. Suppose A,A' cV are sets of 
sites at distance d> from each other. Then 



Proof. Label each site with the last time in the interval [0,t] that its 
spin actually changed. If Xt is frozen on V \A, then for any changed site 
there must be an increasing sequence of such times along a path from that 
site to A. Thus if in addition Xt is not identical to Xq on A' , there must be 
such an increasing sequence of times along a path from A' to A. The result 
now follows in the same way as in the proof of Lemma 3.L □ 

3.2. Monotonicity properties. In this section, we will prove the following 
monotonicity result for Glauber dynamics, which essentially says that, under 
suitable initial conditions, the probability of a particular site retaining its 
initial spin after t steps decays slowly as a function of t. For this property, we 
will relax our usual assumptions and require only that the Glauber dynamics 
is reversible, not necessarily ergodic. This will be the second key ingredient 
in the proof of Theorem 1.1 in the next section. 

Lemma 3.5. Let G = (V, E) be a graph, let v eV, and let C Q. Con- 
sider any continuous-time Glauber dynamics on G with spin space Q, re- 
versible with respect to a distribution vr over $7 C . Let fi be the proba- 
bility, under vr, that the spin at v is in Q^, and suppose < fx < 1. Sample 
the initial configuration Xq according to n, conditioned on the event that 
Xq{v) G Q^. Then, for every t>0. 



Pr{Xt = Xo on A')>1- mm{\5A\,\SA'\} 




Pr{{Xt is frozen onV\A) and not (Xt = Xq on A')) 




Pr{Xt{v) e Q.a) >i2 + {l-fi) exp(-t/(l - /i)). 
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Lemma 3.5 can be derived from a more general monotonicity property 
of reversible Markov chains. Following Aldous and Fill ([3], Chapter 3, Sec- 
tion 4), call a function / completely monotone decreasing {CMD) if it can 
be written in the form 

m 

(7) /(t) = 5]aiexp(-Ait), 

i=l 

where all the coefficients aj,Ai are nonnegative. 

Lemma 3.6. Let {Xt) he a continuous-time Markov chain on finite state 
space VL, and let {Xt) he reversible with respect to a distribution n. Let ^ cil.. 
If the distribution of Xq is supported on ^, and proportional to vr on ^, then 
Fr{Xt £ ^) is CMD. Indeed, 

\n\ 

Pr (Xj G ^) = ^ a, exp(- Ait) , 
1=1 

where J2i oti = 1? ai = vr(^'), Ai = and all a,, Aj > 0. 

As mentioned earlier, this (and indeed more general) monotonicity prop- 
erties for both continuous- and discrete-time Markov chains are well known 
(see, e.g., [3], Chapter 3, Section 4). We include a proof for completeness. 

Proof of Lemma 3.6. As stipulated in the lemma, we let {Xt) have the 
initial distribution /-io(^) = vr(i)/7r(^') for i G \I' (and elsewhere). Writing 
Pt{i,j) for the time-t transition probability from i to j, our main goal is to 
show that the quantity 

(8) Pr{Xt e^)=Y.j2 Mi)mj) = ^{^r' E E ^mi^J) 

is CMD. 

By the standard spectral representation for reversible Markov chains, we 
may write 

Pt{h3) = \ —rrr E ^^P{-^kt)uikUjk, 

where = Ai < A2 < • • • < are the eigenvalues of the transition kernel, 

and (ujj)[^L;^ is an orthonormal matrix. Moreover, we may take un = ^J^T{i), 
since vr is a stationary vector for the Markov chain. Substitution into (8) 
gives 



Vr{Xt £^)= ^{^) ^E'5^P(-^fc*) E E \J ^{i)T^{j)uikUjk 

k ie^jG* 
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2 



(9) 



A: Vie'I' / 



This is clearly of the form (7), so the function is CMD. Moreover, ai satisfies 

2 



Vie* / 



0G^) = 1. ^ 



Vie* 

and by (9) the sum of all coefficients is 

mi / X2 

k=l k ViG'I' / 

We now prove the main result of this section. 

Proof of Lemma 3.5. Let {Xt) be as described in the lemma, and 
define f{t) := 'Pr{Xt{y) G Qv)- By Lemma 3.6, we may write 

f{t) = ^aiexp{-Xit), 

i=l 

where X)!='i = 1, ai= fi, Ai = and all a.j, Aj > 0. 

Now, let us sample J from l,...,|r2| according to the distribution {aj). 
Note that, in this case, 

fit) = E(exp(-Ajt)) = /X + (1 - /i)E(exp(-Ajt) | J > 2). 

Since the function exp(— x) is convex, we may apply Jensen's inequality to 
the conditional probability on the right-hand side to obtain 

fit) >^^ + il-^i) exp(-tE(Aj | J > 2)). 

Observe that 

E(A,|J>2) = -^f:a,A, = ^:^. 

Moreover, for every s > 0, 

/(s) > Pr(clock at v does not ring by time s) = , 
and hence /'(O) > —1, which completes the proof. □ 
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4. Main result. We are now ready to state and prove the continuous-time 
version of our main result. 

Theorem 4.1. Let A > 2 be a fixed positive integer, and let G be any 
graph on n vertices with maximum degree at most A. Any continuous-time 
Glauber dynamics on G has mixing time r2(logn), where the constant in the 
O(-) depends only on A. 

By Corollary 2.2, Theorem 4.1 immediately implies that the discrete-time 
Glauber dynamics has mixing time at least $7(77, log n), thus proving our main 
result (Theorem 1.1) claimed in the Introduction. 

We prove Theorem 4.1 in two steps. First, in Section 4.1, we consider the 
conceptually simpler case in which there are no hard constraints. Then, in 
Section 4.2, we show how to extend the proof to handle hard constraints. 

4.1. Proof for the case of no hard constraints. Let T= g^j^- We will 
show that, for a suitable initial distribution Xq, the variation distance be- 
tween Xx and the stationary distribution is large and hence r^^j^ > T. In 
the remainder of the proof, we will assume that A > 2 is fixed and 77 is 
sufficiently large. 

Let R = Ij^^] ■ Choose a set of sites G CV whose pairwise dis- 

tances are all at least 2R. (Such a set G can always be found by repeatedly 
picking a site, adding it to C, and removing it from V, together with all 
sites at distance < 2R — 1.) We call the elements of G centers, since we will 
shortly be considering a restriction of the dynamics to balls of radius R — 1 
centered at elements of G. 

For each site v €G, let Q^, be a nonempty proper subset of the spins. (For 
instance, they could all be the singleton {1}.) For every configuration X, let 
f{X) denote the fraction of centers v such that ^(t;) S Qy. We will specify 
a distribution for Xq and a threshold /i, such that 

Pr(/(XT)>M)- Pr if{X)>fl) 

where vr is the stationary distribution. This implies that the mixing time is 
greater than T. 

Let U denote the set of sites at distance > R from C, and let ajj be an 
arbitrary assignment of spins to all sites in U. The distribution of Xq will 
be concentrated on configurations which agree with ajj, and satisfy either 
(yv G G){Xq{v) £ Qv), or (Vt; G G){Xq{v) ^ Qy) (we will specify which in 
the next paragraph). Among configurations satisfying these constraints, the 
distribution of Xq will be proportional to vr. 



> 



2e' 
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Let /i denote the conditional expectation of f{Y) when Y is drawn from 
the stationary distribution vr, conditioned on agreeing with ajj. Let e = 
l/(4exp(2r)), and define /i G [0, 1] by 

( fi-e, if i^L> 1/2, 

(10) n={l/'2, if ^1 = 1/2, 

[fi + e, if^Kl/2. 

Let X ~ vr. If Pr(/(X) > /i) < 1/2, then we require that (Vv G C){Xo{v) G 
Qy). Otherwise, we require that (Vv G C)(Xo(u) ^ Qv)- Replacing by 
its complement if necessary, we may assume without loss of generality that 
Pr(/(X) > /2) < 1/2 and {^v G C){Xoiv) G 

With this definition of Xq, it suffices to show that 

(11) Pr(/(XT)</i)<^-f . 

To this end, we introduce a second chain, Yq, . . . ,Yt, which evolves ac- 
cording to the Glauber dynamics except that only the spins on the balls 
[j^^Q Bji-i{v) are updated. (The sites outside these balls retain their ini- 
tial configurations and ignore their clocks.) Note that this chain decomposes 
into fully independent processes on each of the balls Br-i{v). For the initial 
distribution, we take Yq = -'^o- 

We will first show that, under the greedy coupling of {Xt) with (It), the 
expected number of centers G C at which Xt and Yt differ is small, and 
hence that 

(12) Pr(/(XT) < /(Ft) - e/2) < 1/4. 

Then we will take advantage of the independence of the spins Yt{v),v G C, 
to show that fiYx) is concentrated around its expectation and hence also 
that 

(13) Pr(/(yT)</2 + e/2)<l/4-l/(2e). 

Combining inequalities (12) and (13) yields (11) and thus will complete the 
proof. 

Fix V £ C. Since Xq = Yq, and the update rules for the two chains are the 
same on Bji-i{v), it follows from Lemma 3.1 and equation (6) (with r = 0) 
that, under the greedy coupling, 

PriXriv) + Yt{v)) < < e/8. 

The second inequality here follows by plugging in the values for T, R and e. 
By linearity of expectation, it follows that 



Bi\{v G C:Xt{v) / Yt{v)}\) < |C|e/8, 
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for all sufficiently large n. Applying Markov's inequality to the random vari- 
able \{v G C:Xt{v) 7^ now yields our first desired inequality (12). 

Again fix u G C, and let us restrict our attention to the ball B^^i{v). 
Let TTo-y denote the stationary distribution vr on conditioned on 

agreeing with afj. Let fiy = Pr(Y{v) G Qv), where Y is sampled from vTo-y. 
Note that the dynamics (Yt) is reversible with respect to this distribution 
(although it need not be ergodic, because of the fixed boundary). Note also 
that the initial distribution Yq is tTo-^ conditioned on Yq{v) £ Qv Hence by 
Lemma 3.5 applied to (Yt) on the ball B^^i(v), we have 

Pr{YT{v) e Q„) >fi^ + {l- fi^) exp(-r/(l - /x^)). 

There is a subtle technical point in the preceding argument: since we are 
working in continuous time, and because the boundary of Br-i{v) is fixed, 
the projection of (Yt) onto Bfj-i{v) is indeed a Glauber dynamics and so 
Lemma 3.5 applies. 

By linearity of expectation, /i = j^J2vec f^v- Also by linearity of expec- 
tation, we have 

E{f{YT)) = r^Y.^r{YT{v)eQ,) 

- + TTm (1 ~ exp(-T/(l - fi^)). 
I*" I vec 

Since the function xexp(—t/x) is convex on (0,1) as a function of x, for 
fixed t > 0, it follows by Jensen's inequality that 

(14) WiYT)) > + (1 - /i) exp(-r/(l - ^)). 

We now claim that the right-hand side of (14) is at least p, + e. To see 
this, note by the definition of p, that, when > 1/2, we have fi = fx + e. 
On the other hand, when ^ < 1/2, we have (1 — /i)exp(— r/(l — /i)) > 
l/(2exp(2r)) = 2e and /i > /I - e. We conclude from (14) that E(/(yr)) > 
fl + e. 

Since /(Yt) is the average of |C| independent random variables taking 
values in {0, 1}, Chernoff's bound yields 

Pr(/(yr) < /I + e/2) < Pr{f{YT) < B{f{YT)) - e/2) 
<exp(-|C|eV8). 

Writing \C\ and e explicitly in terms of n shows that the right-hand side is 
asymptotically as n— > oo, establishing (13). 
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4.2. Proof for the general case. The proof of the general version of The- 
orem 4.1 will rely on the following fact about spin systems with hard con- 
straints. 

Lemma 4.2. Let v and let R>A be an integer. Let (Xt) and (Yt) be 
continuous-time Glauber dynamics such that the initial configuration Xq G 
is frozen at all sites in Br{v), and the initial configuration Yq differs from 
Xq at V. If T < i?/(5e^AlnA), then the distributions of Xx and Yt are at 
total variation distance at least 

l-2exp(-/?/(31nA)). 

Proof. Let < r < R he another positive integer. By Lemma 3.3 ap- 
phed to (Xt) [setting A = V \ Br{v) and A' = Br{v)], the probability that 
Xt agrees with Xq on Br{v) is at least 1 - \5 A' \{eT /\ / {R - r + l))«-^-+i > 
1 - A(A - 1)^-1 (erA/(i? - r + 1))^-''+^ . On the other hand, by applying 
Lemma 3.4 to {Yt) [setting A = V\Br{v) and A' = {«}], the probability that 
Yt agrees with Xq on Byiv) is at most (eTA/(r -I- 1))'''''^. By the triangle 
inequality, the distributions of Xt and Yt must have total variation distance 
at least 

1 - A^(erA/(fl - r + 1))^-^'+^ - (erA/(r + 1))^+^ 
Letting r = [ii/31n(A)J < i?/2, the result follows by basic algebra. □ 

We are now ready to present the proof of Theorem 4.1 in the presence of 
hard constraints. The broad outline of the proof follows that of the previous 
one where there were no hard constraints; we will focus on the points where 
differences occur. In particular, the selection of the set of centers C, the spin 
sets Qv, and the initial assignment au to the sites at distance > R from C, 
become slightly trickier. 

Let R = \ previous proof. 

Case 1. Suppose there exists a set C C F of size at least [?i/A^^] = 
0(n^/^) at pairwise distance at least 2i?, together with an initial assign- 
ment au to all the sites U at distance > R from C, such that, conditioned 
on the configuration agreeing with au, every v EC has at least two feasible 
spins available. 

In this case, for each v £ C, define Qi, to be a proper nonempty subset 
of the feasible spins for v, conditioned on agreeing with au- Then the rest 
of the proof goes through as in the previous subsection, but with slightly 
worse constants. 

Case 2. No sufficiently large set C as above exists. 

In this case, let C and au be as above, with \C\ < n/A'^^ and C maximal. 
The upper bound on |C| implies that there must exist at least one site v at 
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distance > 3R from C. So, in particular, every site in Bii{v) is at distance 
> 2R from C. It follows that all of Bii{v) must be frozen under au] otherwise, 
we could add any one of its nonfrozen sites to C, contradicting maximality. 

To complete the proof, we argue differently from the previous case: we 
exhibit two initial configurations Xq and Yq such that when both evolve 
according to the Glauber dynamics for time T < 7 log n (for sufficiently small 
7 > 0), Xt and Yt are still at large variation distance. Hence the mixing 
time must be at least T. Let Xq be any configuration extending au, and 
let Yq be any configuration which disagrees with Xq at v. (Such a Yq must 
exist by our nonredundancy condition.) By Lemma 4.2, the mixing time is 
0(i?/(AlogA)) = 0(logn/(Alog2A)). 

This completes the proof of Theorem 4.1 in the general case. 

Remark 4.3. In both cases in the above proof, the dependence of our 
lower bound on A is 0(l/Apolylog A). In the next section, we will prove an 
upper bound whose dependence on A is 0(1/ log A). It would be interesting 
to close this gap and establish a tight dependence on A. 

5. Unbounded degrees and other variations. In this final section we dis- 
cuss various extensions of our main theorem, and also show that certain 
other extensions (notably, removing the assumption of bounded degree) are 
not possible. 

5.1. Graphs with unbounded degree. We have shown a lower bound of 
il(nlog?i) on the mixing time of Glauber dynamics on any family of graphs 
of bounded degree. More precisely, for any fixed A, we have shown that 
the Glauber dynamics on graphs of maximum degree at most A has mixing 
time at least CA^^lnn for some positive constant Ca. Is the restriction to 
bounded-degree graphs necessary? 

We first give a simple example to show that it is. Let G = Kn, and consider 
the hard-core model on G, that is, the spin space is Q = {0,1}, and the 
feasible configurations are those in which at most one vertex has spin 1 
(corresponding to the independent sets of G). The distribution tt assigns 
probability ^ to the all-0 configuration, and ^ to each other configuration. 
(Thus the activity parameter in the hard-core model is A = Consider the 
following Metropolis Glauber dynamics, which at each step picks a vertex v 
u.a.r. If V has spin 1, then its spin is flipped to with probability ^, while 
if V and all other vertices have spin 0, then the spin of v is flipped to 1 
with probability Plainly the mixing time of this Glauber dynamics (in 
discrete time) is 0{n). 

The following more interesting example demonstrates a trade-off between 
maximum degree and mixing time. Let Gq = (VjEq) be any graph of maxi- 
mum degree d < \/A. Let G = {V, E), where E consists of all pairs of vertices 
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whose distance in Go is 1 or 2 (note that the maximum degree of G is at 
most d? < A). We will construct a Glauber dynamics on G which mimics 
any reversible random walk on Gq. To this end, for every edge {tt,f} G Eq 
we let A[u,v) > be the transition probability from u to f in such a ran- 
dom walk, A{v,v) > be the probability of a self-loop at u , and ttq be the 
stationary distribution. For convenience, in what follows we augment Eq to 
include the self-loop {v^v} whenever A(v,v) > 0. 

Our Glauber dynamics will have spins Q = {0, 1, 2}. The feasible config- 
urations (t: y — > Q will all satisfy J2v&v ^i'^) ^ {^i^}. Moreover, when this 
sum is 2 the vertices with nonzero spin will be the endpoints of an edge 
{u,v} (possibly a self-loop) in Eq. Thus we may identify feasible configura- 
tions with the set V L) Eq. 

We now describe the update rule for the Glauber dynamics. Let a be 
the current configuration, and let v be the site selected for updating. The 
new configuration a' is determined as follows (where ~ denotes adjacency 
in Go): 

1. Case 1: a{v) = 0. 

(i) If there exists w ^ v, a{w) = 1, and for all z ^ w, a{z) = 0, then: 
with probability A{w,v), set (7'{v) = 1. 

(ii) Otherwise, set cr'{v) =0. 

2. Case 2: a{v) = 1. 

(i) If there exists w ^ v, a^w) = 1, then set (j'{v) G {0, 1} according 
to the result of a fair coin toss. 

(ii) Otherwise, with probability A{v,v), set cr'{v) = 2. 

3. Case 3: a{v) = 2. Set a'{v) = 1. 

Since the update rule only examines vertices at distance < 2 from t; in Go, 
this is a local dynamics on the graph G. 

Informally, the Glauber dynamics may be described as follows. As indi- 
cated above, we may think of the configurations of the dynamics as either 
vertex states, ay for some f G F, or edge states, ayw for some {v,w} G Eq. 
Call a transition of the Glauber dynamics successful if it causes a change in 
the configuration. Each successful transition causes a change from a vertex 
state to an edge state, or vice versa. Starting from a vertex state ay, the 
first successful transition moves to an edge ayy, (where w = v is possible); 
the next successful transition undoes this move with probability ^, and oth- 
erwise moves to the vertex state a^,, thus completing the simulation of a 
single move of the original random walk. 

It is easy to check that the stationary distribution of the dynamics is 
vr = ^{tt^ + TT^), where vr^ is concentrated on vertex states and satisfies 
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7r^((T„) = vro(f ) for all v ^V, and vr^ is concentrated on edge states and satis- 
fies TT^{(7y^) = irQ{v)A{v, w) + 7ro{w)A{w, v) = 27ro{v)A{v, w) for {v, w} G Eq 
with V ^w, 7r^{ay^) = TrQ{v)A{v,v) for self-loops. Since A is reversible this 
dynamics is also reversible. Thus we have a valid Glauber dynamics. 

Lemma 5.1. The above Glauber dynamics has mixing time 0{nTA), 
where ta is the mixing time of the underlying random walk defined by A 
and n = \V\ . 

Proof (Sketch) . Let (X^q , Xt^ , . . . , Xt^. , . . .) denote the Glauber dynam- 
ics observed after each successful transition, with to = and Xt^ assumed to 
be a vertex state. Then, from the informal description given earlier, {Xt^^) 
is a (reversible) Markov chain on the vertex states, with transition matrix 
^{I + A)^ where / is the nxn identity matrix. Thus, denoting the distri- 
bution of Xt^. by I'k, if we take k > Cta for a sufficiently large constant C, 
then if k is even we have i^^ = vr^ + £k where ||efc||i ^ ^ for any desired posi- 
tive constant e. From the definition of the dynamics, this then easily implies 
Vk = + for k>CTA, k odd, as well, with ||eA;||i bounded similarly. 

Now let {Xq,Xi, . . . ,Xt, . . .) denote the full Glauber dynamics, and let fit 

denote the distribution of Xt. Then fit satisfies fit = J2kPk^^k, where p^^^ is 
the probability that exactly k successful transitions occur within the first t 
steps of the dynamics. Note that the probability of any given transition being 
successful is exactly ^. Thus if we take t = C'uta for a suitable constant C", 

we may ensure that Y.k<CrAPk < ^ and | Efc>Cr^, k e^cnPf This 
in turn implies 

i^t= y: E pf-k=\{^''+^'')+iu 

k<CTA k>CTA 

where ||7t ||i < 3e. □ 

Our motivation for the general construction above is the following special 
case, which illustrates a trade-off between maximum degree and mixing time. 
This is the result we claimed in Theorem 1.2, which we restate here for 
convenience. 

Theorem 5.2. For each n, let A(n) be any natural number satisfying 
2 < A(n) < n. Then there exists a family of graphs G = G{n), where G{n) 
has n vertices and maximum degree A(n), and an associated Glauber dy- 
namics on G{n) with mixing time 0(nlogn/log A(n)). 

Proof. Let A = A(n) and set d = [a/AJ . Without loss of generality we 
may assume (i > 3, as there are many examples of Glauber dynamics with 
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mixing time 0(n log n) on arbitrary graphs. Let Gq be a complete {d — 1)- 
ary tree with n vertices; note that the height of Gq is h = 0(logn/ logd). 
Let A be the biased random walk on Gq which has, at each step, probability 
2/3 of moving to a parent (except at the root), and 1/3 of moving to a 
random child (except at a leaf). It is easily seen (by projecting onto a one- 
dimensional process that walks between the levels of Gq) that the mixing 
time of this random walk is 0(/i) = 0(logn/log(i). Therefore, by Lemma 5.1, 
the Glauber dynamics constructed as above from this random walk has mix- 
ing time 0(n log n/ log d) = 0(nlogn/log A) and degree (P < A, as required. 
(The maximum degree can always be increased to exactly A by adding edges 
as necessary.) □ 

Remark. It is in fact possible, with a little more work, to massage the 
above tree example into one in which the Glauber dynamics is based on a 
Markov random field. 

5.2. Nonuniform update probabilities. In our main theorem, we assumed 
that the site to be updated was chosen uniformly at random (or equivalently, 
in continuous time, all sites are updated at rate 1). How essential is this 
assumption? 

First, it is not hard to check that our proof still goes through more or 
less unchanged under the weaker assumption that no site has a probability 
larger than 0{l/n) of being chosen. On the other hand, if we allow arbi- 
trary site selection probabilities, the theorem no longer holds. To see this, 
consider again the tree example from Section 5.1, with, for example, d = 3, 
but now suppose that vertex v is chosen with probability proportional to 

(d — l)'^/^, where k is the height of v in Gq. Thus the probability of choos- 
ing V is c(d- l)*^/2-h^ g ^ (1 _ i)-i/2)/(i _ l)-{/»+l)/2) > i/4_ 

The updating rule for v is as before. Now by an analysis similar to that of 
Section 5.1, the expected time to reach the root from a leaf is approximately 
proportional to the sum of the waiting times to hit vertices along a leaf-root 
path, which is at most J2k=o c~^{d— l)'^-'=/2 = 0(n). This linear mixing time 
is significantly less than our ^}{nlogn) lower bound for (near-)uniform site 
selection probabilities. 

5.3. Block dynamics. An alternative to Glauber dynamics frequently en- 
countered in the literature is so-called "block dynamics," which, at each step, 
updates all the spins in a randomly chosen "block" of sites. (For instance, 
a block might be the ball of radius 1 around a randomly chosen site.) In 
many cases it is easier to establish rapid mixing for block dynamics (even 
with quite small blocks) than for single-site dynamics (see, e.g., [1, 11, 16]). 
As with the Glauber dynamics, we require that the block dynamics be ir- 
reducible and aperiodic, reversible with respect to vr, and local in the sense 
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that the updates inside the block depend only on the current spins on the 
sites in the block and their neighbors. 

Provided again that G has bounded degree, the blocks are connected and 
of bounded size, and no site is updated at a rate greater than 0(1) (in 
continuous time), our proof of Theorem 4.1 can easily be seen to apply in 
this context as well, giving a lower bound of r2(nlogn) on the mixing time 
of block dynamics. 
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